Introduction

The corpus that will be used for this project is the top 50 of multiple countries on Spotify that has an available top 50. This is an interesting corpus because it can show the differences and similarities between popular music in many different places across the world. The comparison points can be either countries, regions, or continents. This flexibility is another advantage of the corpus. The limitations of this corpus are the fact that recent releases or events can influence music worldwide and can push music to the top of many top 50 lists while not saying much about the differences between places (such as the recent release of Kanye West having multiple songs in multiple different top 50 lists). Another limitation is the fact that multiple countries might not use Spotify as their main source of music, so this could influence regional trends in a way that makes them not representative of the population of the country as a whole. A final weakness is the fact that this corpus changes daily, so it can be hard to look into individual tracks since they will probably be gone after a certain time. Finally, it is difficult to pick specific interesting tracks in this corpus due to the sheer number of tracks (3600) in it. But some that may be interesting at first glance are CARNIVAL (Kanye West), Cruel Summer (Taylor Swift), Unwritten (Natasha Bedingfield) and I Wanna Be Yours (Arctic Monkeys). This Kanye West song and many others of his (very) recent album are interesting because they can be found in nearly every top 50 list. The other three songs are mostly interesting because they are older songs that are still found in multiple top 50 lists across the corpus.

Differences between Pop Music in Different Regions


There are a bunch of violin plots that show the distribution of different Spotify statistics sorted from the highest mean to the lowest mean. This shows a lot about the differences between countries (and continents potentially).

Chroma analysis of an outlier


This is a chroma graph of an outlier in the dataset: “Cruel Summer” by Taylor Swift, which is one of the oldest songs which is found in multiple top 50 lists.

Structure similarity matrix of an outlier


These are two self similarity matrices of “Cruel Summer” by Taylor Swift, these similarity matrices show both Chroma and Timbre.